Cross Lingual Query Dependent Snippet Generation

نویسندگان

  • Pinaki Bhaskar
  • Sivaji Bandyopadhyay
چکیده

The present paper describes the development of a cross lingual query dependent snippet generation module. It is a language independent module, so it also performs as a multilingual snippet generation module. It is a module of the Cross Lingual Information Access (CLIA) system. This module takes the query and content of each retrieved document and generates a query dependent snippet for each retrieved document. It highlights all the query words, which appear in the generated snippet. The algorithm of this module based on the sentence extraction, sentence scoring and sentence ranking. Subjective evaluation has been done to evaluate the output of this module. English snippet got the best evaluation score, i.e. 1 and overall average evaluation score of 0.83 has been achieved in the scale of 0 to 1. Keywords— Snippet Generation, Summarization, Information Extraction, Information Retrieval, Cross Lingual, Multilingual.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Bridging the Gap between Intrinsic and Perceived Relevance in Snippet Generation

Snippet generation plays an important role in a search engine. Good snippets provide users a good indication on the main content of a search result related to the query and on whether one can find relevant information in it. Previous studies on snippet generation focused on selecting sentences that are related to the query and to the document. However, resulting snippet may look highly relevant...

متن کامل

Ranking of Resulting Objects and Snippet Generation

Semantic web search engine Falcons support keyword based search for linked objects by using comprehensive virtual document which it creates for each object. In our work we are suggesting idea of using Selectivity Estimation of triple patterns for ranking of resulting objects and generating snippet for the keyword query for Falcons Semantic web search engine. Selectivity of a triple pattern is t...

متن کامل

Pseudo-relevance feedback and statistical query expansion for web snippet generation

a r t i c l e i n f o a b s t r a c t A (page or web) snippet is a document excerpt allowing a user to understand if a document is indeed relevant without accessing it. This paper proposes an effective snippet generation method. A statistical query expansion approach with pseudo-relevance feedback and text summarization techniques are applied to salient sentence extraction for good quality snip...

متن کامل

xLiD-Lexica: Cross-lingual Linked Data Lexica

In this paper, we introduce our cross-lingual linked data lexica, called xLiD-Lexica, which are constructed by exploiting the multilingual Wikipedia and linked data resources from Linked Open Data (LOD). We provide the cross-lingual groundings of linked data resources from LOD as RDF data, which can be easily integrated into the LOD data sources. In addition, we build a SPARQL endpoint over our...

متن کامل

Parsing the Wiki Collection and Snippet Generation A THESIS SUBMITTED TO THE FACULTY OF THE GRADUATE SCHOOL OF THE UNIVERSITY OF MINNESOTA BY Sai Subramanyam Chittilla IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE DEGREE OF MASTER OF SCIENCE

Information Retrieval (IR) is a field which deals with retrieving useful information from large sets of data in response to a query. Much information in this digital age is stored in XML format, which associates a structure with a document. Though IR systems have been used for years to access documents, the field has greatly expanded with the emergence of the world wide web, which emphasizes th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012